Markov decision process

Results: 537



#Item
11Competing with Humans at Fantasy Football: Team Formation in Large Partially-Observable Domains

Competing with Humans at Fantasy Football: Team Formation in Large Partially-Observable Domains

Add to Reading List

Source URL: www.intelligence.tuc.gr

Language: English - Date: 2012-04-19 16:26:14
12Around Inverse Reinforcement Learning and Score-based Classification Matthieu Geist IMS - MaLIS Research Group (Supélec) Metz, France

Around Inverse Reinforcement Learning and Score-based Classification Matthieu Geist IMS - MaLIS Research Group (Supélec) Metz, France

Add to Reading List

Source URL: www.metz.supelec.fr

Language: English - Date: 2014-01-18 03:53:53
13Increasing the Action Gap: New Operators for Reinforcement Learning Marc G. Bellemare and Georg Ostrovski and Arthur Guez Philip S. Thomas∗ and R´emi Munos Google DeepMind {bellemare,ostrovski,aguez,munos}@google.com;

Increasing the Action Gap: New Operators for Reinforcement Learning Marc G. Bellemare and Georg Ostrovski and Arthur Guez Philip S. Thomas∗ and R´emi Munos Google DeepMind {bellemare,ostrovski,aguez,munos}@google.com;

Add to Reading List

Source URL: psthomas.com

Language: English - Date: 2015-12-12 00:05:18
14Event-Driven Power Management of Portable Systems Tajana Simunicy Luca Benini

Event-Driven Power Management of Portable Systems Tajana Simunicy Luca Benini

Add to Reading List

Source URL: seelab.ucsd.edu

Language: English - Date: 2012-06-04 16:50:20
15JMLR: Workshop and Conference Proceedings vol 40:1–38, 2015  Thompson Sampling for Learning Parameterized Markov Decision Processes Aditya Gopalan

JMLR: Workshop and Conference Proceedings vol 40:1–38, 2015 Thompson Sampling for Learning Parameterized Markov Decision Processes Aditya Gopalan

Add to Reading List

Source URL: jmlr.org

Language: English - Date: 2015-07-20 20:08:36
16Where to Park? Minimizing the Expected Time to Find a Parking Space Igor Bogoslavskyi Luciano Spinello

Where to Park? Minimizing the Expected Time to Find a Parking Space Igor Bogoslavskyi Luciano Spinello

Add to Reading List

Source URL: www.ipb.uni-bonn.de

Language: English - Date: 2016-05-03 10:28:09
17Stochastic Processes, Markov Chains, and Markov Models  Finite-State

Stochastic Processes, Markov Chains, and Markov Models Finite-State

Add to Reading List

Source URL: cl.indiana.edu

Language: English - Date: 2015-09-21 15:09:08
18Using Plan-Based Reward Shaping To Learn Strategies in StarCraft: Broodwar Kyriakos Efthymiadis Daniel Kudenko

Using Plan-Based Reward Shaping To Learn Strategies in StarCraft: Broodwar Kyriakos Efthymiadis Daniel Kudenko

Add to Reading List

Source URL: eldar.mathstat.uoguelph.ca

Language: English - Date: 2016-07-12 12:05:04
19Adaptive Multi-Agent Programming in GTGolog Alberto Finzi1, 2 and Thomas Lukasiewicz2, 1 1 2

Adaptive Multi-Agent Programming in GTGolog Alberto Finzi1, 2 and Thomas Lukasiewicz2, 1 1 2

Add to Reading List

Source URL: www.kr.tuwien.ac.at

Language: English - Date: 2006-05-24 10:11:01
20de Budgeted Classification-based Policy Iteration presented by Victor Gabillon

de Budgeted Classification-based Policy Iteration presented by Victor Gabillon

Add to Reading List

Source URL: victorgabillon.nfshost.com

Language: English - Date: 2015-07-14 00:09:21